Overview

Dataset statistics

Number of variables36
Number of observations1033
Missing cells1642
Missing cells (%)4.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory290.7 KiB
Average record size in memory288.1 B

Variable types

Categorical9
Boolean14
Numeric13

Alerts

AWM year 1 is highly correlated with Overall AWMHigh correlation
AWM year 2 is highly correlated with Overall AWMHigh correlation
AWM year 3 is highly correlated with Overall AWMHigh correlation
Overall AWM is highly correlated with AWM year 1 and 4 other fieldsHigh correlation
First Sit is highly correlated with Second SitHigh correlation
Second Sit is highly correlated with First SitHigh correlation
Fails is highly correlated with Overall AWM and 1 other fieldsHigh correlation
Pass is highly correlated with Overall AWM and 1 other fieldsHigh correlation
English is highly correlated with MathsHigh correlation
Maths is highly correlated with EnglishHigh correlation
AWM year 1 is highly correlated with Overall AWMHigh correlation
AWM year 2 is highly correlated with Overall AWMHigh correlation
AWM year 3 is highly correlated with Overall AWMHigh correlation
Overall AWM is highly correlated with AWM year 1 and 4 other fieldsHigh correlation
First Sit is highly correlated with Second SitHigh correlation
Second Sit is highly correlated with First SitHigh correlation
Fails is highly correlated with Overall AWM and 1 other fieldsHigh correlation
Pass is highly correlated with Overall AWM and 1 other fieldsHigh correlation
AWM year 1 is highly correlated with Overall AWMHigh correlation
AWM year 2 is highly correlated with Overall AWMHigh correlation
Overall AWM is highly correlated with AWM year 1 and 1 other fieldsHigh correlation
First Sit is highly correlated with Second SitHigh correlation
Second Sit is highly correlated with First SitHigh correlation
Fails is highly correlated with PassHigh correlation
Pass is highly correlated with FailsHigh correlation
Polar 4 Score is highly correlated with BursaryHigh correlation
desertion is highly correlated with ProgressHigh correlation
Progress is highly correlated with desertionHigh correlation
A Levels is highly correlated with BtecHigh correlation
Bursary is highly correlated with Polar 4 ScoreHigh correlation
British is highly correlated with Student VisaHigh correlation
Btec is highly correlated with A LevelsHigh correlation
Student Visa is highly correlated with BritishHigh correlation
Course is highly correlated with UCAS and 1 other fieldsHigh correlation
UCAS is highly correlated with Course and 1 other fieldsHigh correlation
Disability is highly correlated with BursaryHigh correlation
British is highly correlated with English native Language and 1 other fieldsHigh correlation
English native Language is highly correlated with BritishHigh correlation
Polar 4 Score is highly correlated with BursaryHigh correlation
SLC is highly correlated with Student VisaHigh correlation
Care Leaver is highly correlated with RefugeeHigh correlation
Student Visa is highly correlated with SLCHigh correlation
Refugee is highly correlated with Care LeaverHigh correlation
London Permanent Residence is highly correlated with BritishHigh correlation
UCAS Points is highly correlated with English and 1 other fieldsHigh correlation
English is highly correlated with UCAS Points and 1 other fieldsHigh correlation
Maths is highly correlated with UCAS Points and 1 other fieldsHigh correlation
A Levels is highly correlated with BtecHigh correlation
Btec is highly correlated with A LevelsHigh correlation
Bursary is highly correlated with Disability and 1 other fieldsHigh correlation
Attendance is highly correlated with AWM year 2 and 3 other fieldsHigh correlation
AWM year 1 is highly correlated with Overall AWM and 2 other fieldsHigh correlation
AWM year 2 is highly correlated with Course and 5 other fieldsHigh correlation
AWM year 3 is highly correlated with AWM year 2 and 6 other fieldsHigh correlation
Overall AWM is highly correlated with Attendance and 7 other fieldsHigh correlation
Progress is highly correlated with Attendance and 7 other fieldsHigh correlation
First Sit is highly correlated with Second Sit and 2 other fieldsHigh correlation
Second Sit is highly correlated with First Sit and 1 other fieldsHigh correlation
Fails is highly correlated with AWM year 3 and 4 other fieldsHigh correlation
No Submissions is highly correlated with First Sit and 2 other fieldsHigh correlation
Pass is highly correlated with AWM year 3 and 4 other fieldsHigh correlation
Re Takes is highly correlated with AWM year 3 and 1 other fieldsHigh correlation
desertion is highly correlated with UCAS and 9 other fieldsHigh correlation
Ethnicity has 13 (1.3%) missing values Missing
British has 71 (6.9%) missing values Missing
English native Language has 69 (6.7%) missing values Missing
Parent He attendance has 37 (3.6%) missing values Missing
Polar 4 Score has 118 (11.4%) missing values Missing
Care Leaver has 158 (15.3%) missing values Missing
Student Visa has 69 (6.7%) missing values Missing
UCAS Points has 54 (5.2%) missing values Missing
English has 160 (15.5%) missing values Missing
Maths has 161 (15.6%) missing values Missing
A Levels has 60 (5.8%) missing values Missing
Btec has 109 (10.6%) missing values Missing
Bursary has 162 (15.7%) missing values Missing
AWM year 2 has 109 (10.6%) missing values Missing
AWM year 3 has 268 (25.9%) missing values Missing
Second Sit has 208 (20.1%) zeros Zeros
Fails has 849 (82.2%) zeros Zeros
No Submissions has 423 (40.9%) zeros Zeros

Reproduction

Analysis started2022-08-16 15:42:28.623929
Analysis finished2022-08-16 15:43:11.380465
Duration42.76 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

Course
Categorical

HIGH CORRELATION

Distinct13
Distinct (%)1.3%
Missing2
Missing (%)0.2%
Memory size8.2 KiB
BA
389 
ba
380 
BA Business Management Enterpreneurship and Innovation
86 
BA Business Management
63 
Ba Business Management Finance
39 
Other values (8)
74 

Length

Max length55
Median length2
Mean length10.3986421
Min length2

Characters and Unicode

Total characters10721
Distinct characters28
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st rowBA Business Manangement Enterpreneurship and Innovation
2nd rowBA Business Management
3rd rowBA Business Management Enterpreneurship and Innovation
4th rowBA Business Management
5th rowBA Business Management Enterpreneurship and Innovation

Common Values

ValueCountFrequency (%)
BA389
37.7%
ba380
36.8%
BA Business Management Enterpreneurship and Innovation86
 
8.3%
BA Business Management63
 
6.1%
Ba Business Management Finance39
 
3.8%
BA Business Management Marketing37
 
3.6%
BA Business Management International Business12
 
1.2%
MBA10
 
1.0%
BA 6
 
0.6%
Ba4
 
0.4%
Other values (3)5
 
0.5%

Length

2022-08-16T16:43:11.531064image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ba1020
54.2%
business253
 
13.5%
management238
 
12.7%
enterpreneurship89
 
4.7%
and89
 
4.7%
innovation89
 
4.7%
finance39
 
2.1%
marketing38
 
2.0%
international12
 
0.6%
mba11
 
0.6%

Most occurring characters

ValueCountFrequency (%)
n1424
13.3%
a1184
11.0%
e1091
10.2%
B904
 
8.4%
857
 
8.0%
s848
 
7.9%
A608
 
5.7%
i520
 
4.9%
t481
 
4.5%
b380
 
3.5%
Other values (18)2424
22.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter7831
73.0%
Uppercase Letter2031
 
18.9%
Space Separator857
 
8.0%
Open Punctuation1
 
< 0.1%
Close Punctuation1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n1424
18.2%
a1184
15.1%
e1091
13.9%
s848
10.8%
i520
 
6.6%
t481
 
6.1%
b380
 
4.9%
u342
 
4.4%
r317
 
4.0%
g279
 
3.6%
Other values (9)965
12.3%
Uppercase Letter
ValueCountFrequency (%)
B904
44.5%
A608
29.9%
M290
 
14.3%
I101
 
5.0%
E89
 
4.4%
F39
 
1.9%
Space Separator
ValueCountFrequency (%)
857
100.0%
Open Punctuation
ValueCountFrequency (%)
(1
100.0%
Close Punctuation
ValueCountFrequency (%)
)1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin9862
92.0%
Common859
 
8.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n1424
14.4%
a1184
12.0%
e1091
11.1%
B904
9.2%
s848
8.6%
A608
 
6.2%
i520
 
5.3%
t481
 
4.9%
b380
 
3.9%
u342
 
3.5%
Other values (15)2080
21.1%
Common
ValueCountFrequency (%)
857
99.8%
(1
 
0.1%
)1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII10721
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n1424
13.3%
a1184
11.0%
e1091
10.2%
B904
 
8.4%
857
 
8.0%
s848
 
7.9%
A608
 
5.7%
i520
 
4.9%
t481
 
4.5%
b380
 
3.5%
Other values (18)2424
22.6%

UCAS
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
True
938 
False
95 
ValueCountFrequency (%)
True938
90.8%
False95
 
9.2%
2022-08-16T16:43:11.695624image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

25 Above
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
no
868 
yes
161 
no
 
4

Length

Max length3
Median length2
Mean length2.159728945
Min length2

Characters and Unicode

Total characters2231
Distinct characters6
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowno
2nd rowno
3rd rowno
4th rowyes
5th rowno

Common Values

ValueCountFrequency (%)
no868
84.0%
yes161
 
15.6%
no 4
 
0.4%

Length

2022-08-16T16:43:11.817298image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:11.950942image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
no872
84.4%
yes161
 
15.6%

Most occurring characters

ValueCountFrequency (%)
n872
39.1%
o872
39.1%
y161
 
7.2%
e161
 
7.2%
s161
 
7.2%
4
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2227
99.8%
Space Separator4
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n872
39.2%
o872
39.2%
y161
 
7.2%
e161
 
7.2%
s161
 
7.2%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2227
99.8%
Common4
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
n872
39.2%
o872
39.2%
y161
 
7.2%
e161
 
7.2%
s161
 
7.2%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2231
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n872
39.1%
o872
39.1%
y161
 
7.2%
e161
 
7.2%
s161
 
7.2%
4
 
0.2%

Disability
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
False
967 
True
 
66
ValueCountFrequency (%)
False967
93.6%
True66
 
6.4%
2022-08-16T16:43:12.065634image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Ethnicity
Categorical

MISSING

Distinct6
Distinct (%)0.6%
Missing13
Missing (%)1.3%
Memory size8.2 KiB
White
501 
Asian
279 
Black/Black British African
159 
Other ethnic background
76 
Other Black Background
 
3

Length

Max length27
Median length5
Mean length9.851960784
Min length5

Characters and Unicode

Total characters10049
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAsian
2nd rowWhite
3rd rowAsian
4th rowWhite
5th rowAsian

Common Values

ValueCountFrequency (%)
White501
48.5%
Asian279
27.0%
Black/Black British African159
 
15.4%
Other ethnic background76
 
7.4%
Other Black Background3
 
0.3%
Mixed White and Asian2
 
0.2%
(Missing)13
 
1.3%

Length

2022-08-16T16:43:12.179362image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:12.329528image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
white503
33.5%
asian281
18.7%
black/black159
 
10.6%
british159
 
10.6%
african159
 
10.6%
other79
 
5.3%
background79
 
5.3%
ethnic76
 
5.1%
black3
 
0.2%
mixed2
 
0.1%

Most occurring characters

ValueCountFrequency (%)
i1339
13.3%
a842
 
8.4%
t817
 
8.1%
h817
 
8.1%
e660
 
6.6%
c635
 
6.3%
n597
 
5.9%
W503
 
5.0%
B483
 
4.8%
482
 
4.8%
Other values (15)2874
28.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter7901
78.6%
Uppercase Letter1507
 
15.0%
Space Separator482
 
4.8%
Other Punctuation159
 
1.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i1339
16.9%
a842
10.7%
t817
10.3%
h817
10.3%
e660
8.4%
c635
8.0%
n597
7.6%
r476
 
6.0%
s440
 
5.6%
k400
 
5.1%
Other values (8)878
11.1%
Uppercase Letter
ValueCountFrequency (%)
W503
33.4%
B483
32.1%
A440
29.2%
O79
 
5.2%
M2
 
0.1%
Space Separator
ValueCountFrequency (%)
482
100.0%
Other Punctuation
ValueCountFrequency (%)
/159
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin9408
93.6%
Common641
 
6.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
i1339
14.2%
a842
 
8.9%
t817
 
8.7%
h817
 
8.7%
e660
 
7.0%
c635
 
6.7%
n597
 
6.3%
W503
 
5.3%
B483
 
5.1%
r476
 
5.1%
Other values (13)2239
23.8%
Common
ValueCountFrequency (%)
482
75.2%
/159
 
24.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII10049
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i1339
13.3%
a842
 
8.4%
t817
 
8.1%
h817
 
8.1%
e660
 
6.6%
c635
 
6.3%
n597
 
5.9%
W503
 
5.0%
B483
 
4.8%
482
 
4.8%
Other values (15)2874
28.6%

Gender
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Male
636 
Female
391 
Female
 
3
Male
 
3

Length

Max length7
Median length4
Mean length4.768635044
Min length4

Characters and Unicode

Total characters4926
Distinct characters7
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMale
2nd rowMale
3rd rowMale
4th rowFemale
5th rowMale

Common Values

ValueCountFrequency (%)
Male636
61.6%
Female391
37.9%
Female 3
 
0.3%
Male 3
 
0.3%

Length

2022-08-16T16:43:12.485025image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:12.624681image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
male639
61.9%
female394
38.1%

Most occurring characters

ValueCountFrequency (%)
e1427
29.0%
a1033
21.0%
l1033
21.0%
M639
13.0%
F394
 
8.0%
m394
 
8.0%
6
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter3887
78.9%
Uppercase Letter1033
 
21.0%
Space Separator6
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1427
36.7%
a1033
26.6%
l1033
26.6%
m394
 
10.1%
Uppercase Letter
ValueCountFrequency (%)
M639
61.9%
F394
38.1%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4920
99.9%
Common6
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1427
29.0%
a1033
21.0%
l1033
21.0%
M639
13.0%
F394
 
8.0%
m394
 
8.0%
Common
ValueCountFrequency (%)
6
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII4926
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1427
29.0%
a1033
21.0%
l1033
21.0%
M639
13.0%
F394
 
8.0%
m394
 
8.0%
6
 
0.1%

British
Boolean

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing71
Missing (%)6.9%
Memory size2.1 KiB
True
579 
False
383 
(Missing)
71 
ValueCountFrequency (%)
True579
56.1%
False383
37.1%
(Missing)71
 
6.9%
2022-08-16T16:43:12.748357image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

English native Language
Boolean

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing69
Missing (%)6.7%
Memory size2.1 KiB
True
497 
False
467 
(Missing)
69 
ValueCountFrequency (%)
True497
48.1%
False467
45.2%
(Missing)69
 
6.7%
2022-08-16T16:43:12.858103image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Parent He attendance
Boolean

MISSING

Distinct2
Distinct (%)0.2%
Missing37
Missing (%)3.6%
Memory size2.1 KiB
False
535 
True
461 
(Missing)
 
37
ValueCountFrequency (%)
False535
51.8%
True461
44.6%
(Missing)37
 
3.6%
2022-08-16T16:43:12.964336image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Polar 4 Score
Categorical

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct5
Distinct (%)0.5%
Missing118
Missing (%)11.4%
Memory size8.2 KiB
4.0
314 
3.0
223 
5.0
180 
2.0
107 
1.0
91 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters2745
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4.0
2nd row2.0
3rd row4.0
4th row3.0
5th row4.0

Common Values

ValueCountFrequency (%)
4.0314
30.4%
3.0223
21.6%
5.0180
17.4%
2.0107
 
10.4%
1.091
 
8.8%
(Missing)118
 
11.4%

Length

2022-08-16T16:43:13.066039image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:13.196686image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
4.0314
34.3%
3.0223
24.4%
5.0180
19.7%
2.0107
 
11.7%
1.091
 
9.9%

Most occurring characters

ValueCountFrequency (%)
.915
33.3%
0915
33.3%
4314
 
11.4%
3223
 
8.1%
5180
 
6.6%
2107
 
3.9%
191
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1830
66.7%
Other Punctuation915
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0915
50.0%
4314
 
17.2%
3223
 
12.2%
5180
 
9.8%
2107
 
5.8%
191
 
5.0%
Other Punctuation
ValueCountFrequency (%)
.915
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common2745
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.915
33.3%
0915
33.3%
4314
 
11.4%
3223
 
8.1%
5180
 
6.6%
2107
 
3.9%
191
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII2745
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.915
33.3%
0915
33.3%
4314
 
11.4%
3223
 
8.1%
5180
 
6.6%
2107
 
3.9%
191
 
3.3%

SLC
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
yes
734 
no
287 
no
 
12

Length

Max length3
Median length3
Mean length2.722168441
Min length2

Characters and Unicode

Total characters2812
Distinct characters6
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowno
2nd rowyes
3rd rowyes
4th rowyes
5th rowyes

Common Values

ValueCountFrequency (%)
yes734
71.1%
no287
 
27.8%
no 12
 
1.2%

Length

2022-08-16T16:43:13.332352image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:13.467960image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
yes734
71.1%
no299
28.9%

Most occurring characters

ValueCountFrequency (%)
y734
26.1%
e734
26.1%
s734
26.1%
n299
10.6%
o299
10.6%
12
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2800
99.6%
Space Separator12
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
y734
26.2%
e734
26.2%
s734
26.2%
n299
10.7%
o299
10.7%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2800
99.6%
Common12
 
0.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
y734
26.2%
e734
26.2%
s734
26.2%
n299
10.7%
o299
10.7%
Common
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2812
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
y734
26.1%
e734
26.1%
s734
26.1%
n299
10.6%
o299
10.6%
12
 
0.4%

Care Leaver
Categorical

HIGH CORRELATION
MISSING

Distinct4
Distinct (%)0.5%
Missing158
Missing (%)15.3%
Memory size8.2 KiB
no
831 
no
 
24
yes
 
19
no
 
1

Length

Max length3
Median length2
Mean length2.050285714
Min length2

Characters and Unicode

Total characters1794
Distinct characters6
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowno
2nd rowno
3rd rowno
4th rowno
5th rowno

Common Values

ValueCountFrequency (%)
no831
80.4%
no 24
 
2.3%
yes19
 
1.8%
no1
 
0.1%
(Missing)158
 
15.3%

Length

2022-08-16T16:43:13.584648image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:13.729261image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
no856
97.8%
yes19
 
2.2%

Most occurring characters

ValueCountFrequency (%)
n856
47.7%
o856
47.7%
25
 
1.4%
y19
 
1.1%
e19
 
1.1%
s19
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1769
98.6%
Space Separator25
 
1.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n856
48.4%
o856
48.4%
y19
 
1.1%
e19
 
1.1%
s19
 
1.1%
Space Separator
ValueCountFrequency (%)
25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1769
98.6%
Common25
 
1.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
n856
48.4%
o856
48.4%
y19
 
1.1%
e19
 
1.1%
s19
 
1.1%
Common
ValueCountFrequency (%)
25
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1794
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n856
47.7%
o856
47.7%
25
 
1.4%
y19
 
1.1%
e19
 
1.1%
s19
 
1.1%

Student Visa
Categorical

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.3%
Missing69
Missing (%)6.7%
Memory size8.2 KiB
no
792 
yes
155 
no
 
17

Length

Max length3
Median length2
Mean length2.178423237
Min length2

Characters and Unicode

Total characters2100
Distinct characters6
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowyes
2nd rowno
3rd rowno
4th rowno
5th rowno

Common Values

ValueCountFrequency (%)
no792
76.7%
yes155
 
15.0%
no 17
 
1.6%
(Missing)69
 
6.7%

Length

2022-08-16T16:43:13.857918image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:13.988585image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
no809
83.9%
yes155
 
16.1%

Most occurring characters

ValueCountFrequency (%)
n809
38.5%
o809
38.5%
y155
 
7.4%
e155
 
7.4%
s155
 
7.4%
17
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2083
99.2%
Space Separator17
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n809
38.8%
o809
38.8%
y155
 
7.4%
e155
 
7.4%
s155
 
7.4%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2083
99.2%
Common17
 
0.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
n809
38.8%
o809
38.8%
y155
 
7.4%
e155
 
7.4%
s155
 
7.4%
Common
ValueCountFrequency (%)
17
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2100
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n809
38.5%
o809
38.5%
y155
 
7.4%
e155
 
7.4%
s155
 
7.4%
17
 
0.8%

Refugee
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing7
Missing (%)0.7%
Memory size2.1 KiB
False
1002 
True
 
24
(Missing)
 
7
ValueCountFrequency (%)
False1002
97.0%
True24
 
2.3%
(Missing)7
 
0.7%
2022-08-16T16:43:14.101164image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

London Permanent Residence
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing5
Missing (%)0.5%
Memory size2.1 KiB
True
568 
False
460 
(Missing)
 
5
ValueCountFrequency (%)
True568
55.0%
False460
44.5%
(Missing)5
 
0.5%
2022-08-16T16:43:14.205884image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

UCAS Points
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct60
Distinct (%)6.1%
Missing54
Missing (%)5.2%
Infinite0
Infinite (%)0.0%
Mean109.113381
Minimum72
Maximum168
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:14.342519image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum72
5-th percentile82
Q196
median104
Q3120
95-th percentile152
Maximum168
Range96
Interquartile range (IQR)24

Descriptive statistics

Standard deviation20.20580245
Coefficient of variation (CV)0.1851817098
Kurtosis0.8416214006
Mean109.113381
Median Absolute Deviation (MAD)11
Skewness0.9814397744
Sum106822
Variance408.2744527
MonotonicityNot monotonic
2022-08-16T16:43:14.515565image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9684
 
8.1%
10451
 
4.9%
12847
 
4.5%
8036
 
3.5%
12036
 
3.5%
11235
 
3.4%
8435
 
3.4%
8833
 
3.2%
10033
 
3.2%
10330
 
2.9%
Other values (50)559
54.1%
(Missing)54
 
5.2%
ValueCountFrequency (%)
724
 
0.4%
8036
3.5%
8222
2.1%
8435
3.4%
851
 
0.1%
8610
 
1.0%
875
 
0.5%
8833
3.2%
897
 
0.7%
906
 
0.6%
ValueCountFrequency (%)
16825
2.4%
1625
 
0.5%
1608
 
0.8%
1551
 
0.1%
1538
 
0.8%
15215
1.5%
1486
 
0.6%
1464
 
0.4%
14418
1.7%
1366
 
0.6%

English
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.9%
Missing160
Missing (%)15.5%
Infinite0
Infinite (%)0.0%
Mean4.924398625
Minimum2
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:14.658191image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3
Q14
median5
Q36
95-th percentile8
Maximum9
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.385131692
Coefficient of variation (CV)0.2812793596
Kurtosis0.3617214903
Mean4.924398625
Median Absolute Deviation (MAD)1
Skewness0.7507784704
Sum4299
Variance1.918589804
MonotonicityNot monotonic
2022-08-16T16:43:14.769863image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
4290
28.1%
5256
24.8%
6120
11.6%
382
 
7.9%
853
 
5.1%
751
 
4.9%
211
 
1.1%
910
 
1.0%
(Missing)160
15.5%
ValueCountFrequency (%)
211
 
1.1%
382
 
7.9%
4290
28.1%
5256
24.8%
6120
11.6%
751
 
4.9%
853
 
5.1%
910
 
1.0%
ValueCountFrequency (%)
910
 
1.0%
853
 
5.1%
751
 
4.9%
6120
11.6%
5256
24.8%
4290
28.1%
382
 
7.9%
211
 
1.1%

Maths
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.9%
Missing161
Missing (%)15.6%
Infinite0
Infinite (%)0.0%
Mean4.774082569
Minimum2
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:14.907495image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3
Q14
median5
Q35
95-th percentile7
Maximum9
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.19916459
Coefficient of variation (CV)0.2511822057
Kurtosis0.761532845
Mean4.774082569
Median Absolute Deviation (MAD)1
Skewness0.569859444
Sum4163
Variance1.437995713
MonotonicityNot monotonic
2022-08-16T16:43:15.016205image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
4345
33.4%
5256
24.8%
6124
 
12.0%
760
 
5.8%
346
 
4.5%
222
 
2.1%
814
 
1.4%
95
 
0.5%
(Missing)161
15.6%
ValueCountFrequency (%)
222
 
2.1%
346
 
4.5%
4345
33.4%
5256
24.8%
6124
 
12.0%
760
 
5.8%
814
 
1.4%
95
 
0.5%
ValueCountFrequency (%)
95
 
0.5%
814
 
1.4%
760
 
5.8%
6124
 
12.0%
5256
24.8%
4345
33.4%
346
 
4.5%
222
 
2.1%

A Levels
Boolean

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing60
Missing (%)5.8%
Memory size2.1 KiB
True
519 
False
454 
(Missing)
60 
ValueCountFrequency (%)
True519
50.2%
False454
43.9%
(Missing)60
 
5.8%
2022-08-16T16:43:15.150874image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Btec
Boolean

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing109
Missing (%)10.6%
Memory size2.1 KiB
False
545 
True
379 
(Missing)
109 
ValueCountFrequency (%)
False545
52.8%
True379
36.7%
(Missing)109
 
10.6%
2022-08-16T16:43:15.255565image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Distinct2
Distinct (%)0.2%
Missing3
Missing (%)0.3%
Memory size2.1 KiB
True
538 
False
492 
(Missing)
 
3
ValueCountFrequency (%)
True538
52.1%
False492
47.6%
(Missing)3
 
0.3%
2022-08-16T16:43:15.362279image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Bursary
Boolean

HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing162
Missing (%)15.7%
Memory size2.1 KiB
False
626 
True
245 
(Missing)
162 
ValueCountFrequency (%)
False626
60.6%
True245
 
23.7%
(Missing)162
 
15.7%
2022-08-16T16:43:15.480026image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Attendance
Real number (ℝ≥0)

HIGH CORRELATION

Distinct63
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.08712488
Minimum20
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:15.619678image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile46
Q164
median76
Q388
95-th percentile97
Maximum100
Range80
Interquartile range (IQR)24

Descriptive statistics

Standard deviation15.73841886
Coefficient of variation (CV)0.2096020974
Kurtosis-0.6273441074
Mean75.08712488
Median Absolute Deviation (MAD)12
Skewness-0.3975210015
Sum77565
Variance247.6978283
MonotonicityNot monotonic
2022-08-16T16:43:16.115348image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6034
 
3.3%
9231
 
3.0%
9529
 
2.8%
7428
 
2.7%
9627
 
2.6%
9027
 
2.6%
8127
 
2.6%
7226
 
2.5%
6525
 
2.4%
9425
 
2.4%
Other values (53)754
73.0%
ValueCountFrequency (%)
201
 
0.1%
251
 
0.1%
406
0.6%
416
0.6%
4214
1.4%
433
 
0.3%
448
0.8%
4512
1.2%
467
0.7%
4710
1.0%
ValueCountFrequency (%)
10015
1.5%
9915
1.5%
9820
1.9%
9715
1.5%
9627
2.6%
9529
2.8%
9425
2.4%
9313
1.3%
9231
3.0%
9116
1.5%

AWM year 1
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct56
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.72700871
Minimum30
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:16.296863image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile40
Q146
median58
Q371
95-th percentile82
Maximum85
Range55
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.24729732
Coefficient of variation (CV)0.2426021287
Kurtosis-1.118926416
Mean58.72700871
Median Absolute Deviation (MAD)12
Skewness0.1421297796
Sum60665
Variance202.9854811
MonotonicityNot monotonic
2022-08-16T16:43:16.458431image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4138
 
3.7%
4434
 
3.3%
4334
 
3.3%
4534
 
3.3%
4731
 
3.0%
4629
 
2.8%
8029
 
2.8%
4228
 
2.7%
4027
 
2.6%
5026
 
2.5%
Other values (46)723
70.0%
ValueCountFrequency (%)
305
0.5%
314
0.4%
325
0.5%
335
0.5%
344
0.4%
355
0.5%
366
0.6%
372
 
0.2%
386
0.6%
396
0.6%
ValueCountFrequency (%)
8519
1.8%
8417
1.6%
8312
1.2%
8213
1.3%
818
 
0.8%
8029
2.8%
7917
1.6%
7813
1.3%
7724
2.3%
7620
1.9%

AWM year 2
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct58
Distinct (%)6.3%
Missing109
Missing (%)10.6%
Infinite0
Infinite (%)0.0%
Mean60.36255411
Minimum0
Maximum87
Zeros2
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:16.636955image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile35
Q148
median60
Q374
95-th percentile83
Maximum87
Range87
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.49261678
Coefficient of variation (CV)0.2566593977
Kurtosis-0.7365486038
Mean60.36255411
Median Absolute Deviation (MAD)13
Skewness-0.1855494005
Sum55775
Variance240.0211748
MonotonicityNot monotonic
2022-08-16T16:43:16.797524image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7927
 
2.6%
4027
 
2.6%
5525
 
2.4%
7125
 
2.4%
6223
 
2.2%
4923
 
2.2%
8123
 
2.2%
4622
 
2.1%
8322
 
2.1%
5821
 
2.0%
Other values (48)686
66.4%
(Missing)109
 
10.6%
ValueCountFrequency (%)
02
 
0.2%
303
 
0.3%
316
0.6%
3211
1.1%
339
0.9%
346
0.6%
3514
1.4%
367
0.7%
375
 
0.5%
3812
1.2%
ValueCountFrequency (%)
8712
1.2%
8515
1.5%
8414
1.4%
8322
2.1%
8218
1.7%
8123
2.2%
8016
1.5%
7927
2.6%
7819
1.8%
7718
1.7%

AWM year 3
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct57
Distinct (%)7.5%
Missing268
Missing (%)25.9%
Infinite0
Infinite (%)0.0%
Mean58.43529412
Minimum0
Maximum85
Zeros7
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:16.965076image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile36
Q145
median58
Q371
95-th percentile82
Maximum85
Range85
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.73646044
Coefficient of variation (CV)0.269297189
Kurtosis0.175645233
Mean58.43529412
Median Absolute Deviation (MAD)13
Skewness-0.3348012543
Sum44703
Variance247.6361872
MonotonicityNot monotonic
2022-08-16T16:43:17.127171image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4325
 
2.4%
4424
 
2.3%
4124
 
2.3%
5222
 
2.1%
6320
 
1.9%
4520
 
1.9%
7619
 
1.8%
6619
 
1.8%
6519
 
1.8%
4219
 
1.8%
Other values (47)554
53.6%
(Missing)268
25.9%
ValueCountFrequency (%)
07
0.7%
302
 
0.2%
316
0.6%
322
 
0.2%
335
0.5%
345
0.5%
357
0.7%
3610
1.0%
3712
1.2%
384
 
0.4%
ValueCountFrequency (%)
8513
1.3%
8415
1.5%
8310
1.0%
8217
1.6%
816
 
0.6%
8017
1.6%
7913
1.3%
7812
1.2%
7712
1.2%
7619
1.8%

Overall AWM
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct166
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.11777993
Minimum20.5
Maximum84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:17.298712image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum20.5
5-th percentile38
Q151
median59.33333333
Q366.33333333
95-th percentile74.66666667
Maximum84
Range63.5
Interquartile range (IQR)15.33333333

Descriptive statistics

Standard deviation11.2515911
Coefficient of variation (CV)0.1935998091
Kurtosis-0.2921185457
Mean58.11777993
Median Absolute Deviation (MAD)7.333333333
Skewness-0.4229072332
Sum60035.66667
Variance126.5983022
MonotonicityNot monotonic
2022-08-16T16:43:17.469257image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
59.3333333325
 
2.4%
58.3333333319
 
1.8%
67.3333333317
 
1.6%
6017
 
1.6%
6716
 
1.5%
6416
 
1.5%
60.6666666716
 
1.5%
5715
 
1.5%
6115
 
1.5%
56.3333333315
 
1.5%
Other values (156)862
83.4%
ValueCountFrequency (%)
20.51
 
0.1%
27.333333331
 
0.1%
27.666666671
 
0.1%
305
0.5%
314
0.4%
325
0.5%
335
0.5%
344
0.4%
356
0.6%
368
0.8%
ValueCountFrequency (%)
841
 
0.1%
82.666666671
 
0.1%
822
0.2%
812
0.2%
80.666666671
 
0.1%
80.51
 
0.1%
80.333333332
0.2%
803
0.3%
793
0.3%
78.51
 
0.1%

Progress
Boolean

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
True
849 
False
184 
ValueCountFrequency (%)
True849
82.2%
False184
 
17.8%
2022-08-16T16:43:17.624352image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

First Sit
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.016456922
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:17.719101image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median4
Q35
95-th percentile6
Maximum6
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.303216144
Coefficient of variation (CV)0.3244690954
Kurtosis-0.7034862085
Mean4.016456922
Median Absolute Deviation (MAD)1
Skewness0.024766854
Sum4149
Variance1.698372318
MonotonicityNot monotonic
2022-08-16T16:43:17.834791image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3373
36.1%
4219
21.2%
6189
18.3%
5183
17.7%
236
 
3.5%
133
 
3.2%
ValueCountFrequency (%)
133
 
3.2%
236
 
3.5%
3373
36.1%
4219
21.2%
5183
17.7%
6189
18.3%
ValueCountFrequency (%)
6189
18.3%
5183
17.7%
4219
21.2%
3373
36.1%
236
 
3.5%
133
 
3.2%

Second Sit
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)0.6%
Missing7
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean1.834307992
Minimum0
Maximum5
Zeros208
Zeros (%)20.1%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:17.941504image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile3
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.250885485
Coefficient of variation (CV)0.6819386331
Kurtosis-0.8041322648
Mean1.834307992
Median Absolute Deviation (MAD)1
Skewness-0.01403697631
Sum1882
Variance1.564714496
MonotonicityNot monotonic
2022-08-16T16:43:18.052209image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3349
33.8%
2232
22.5%
0208
20.1%
1199
19.3%
520
 
1.9%
418
 
1.7%
(Missing)7
 
0.7%
ValueCountFrequency (%)
0208
20.1%
1199
19.3%
2232
22.5%
3349
33.8%
418
 
1.7%
520
 
1.9%
ValueCountFrequency (%)
520
 
1.9%
418
 
1.7%
3349
33.8%
2232
22.5%
1199
19.3%
0208
20.1%

Fails
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5634075508
Minimum0
Maximum5
Zeros849
Zeros (%)82.2%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:18.160965image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.308920203
Coefficient of variation (CV)2.32322091
Kurtosis3.602760227
Mean0.5634075508
Median Absolute Deviation (MAD)0
Skewness2.210902342
Sum582
Variance1.713272098
MonotonicityNot monotonic
2022-08-16T16:43:18.277606image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0849
82.2%
253
 
5.1%
350
 
4.8%
439
 
3.8%
532
 
3.1%
110
 
1.0%
ValueCountFrequency (%)
0849
82.2%
110
 
1.0%
253
 
5.1%
350
 
4.8%
439
 
3.8%
532
 
3.1%
ValueCountFrequency (%)
532
 
3.1%
439
 
3.8%
350
 
4.8%
253
 
5.1%
110
 
1.0%
0849
82.2%

No Submissions
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.234269119
Minimum0
Maximum5
Zeros423
Zeros (%)40.9%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:18.388311image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.363742454
Coefficient of variation (CV)1.104898788
Kurtosis-0.06660940015
Mean1.234269119
Median Absolute Deviation (MAD)1
Skewness0.9492579822
Sum1275
Variance1.859793482
MonotonicityNot monotonic
2022-08-16T16:43:18.509495image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0423
40.9%
1253
24.5%
2165
 
16.0%
396
 
9.3%
476
 
7.4%
520
 
1.9%
ValueCountFrequency (%)
0423
40.9%
1253
24.5%
2165
 
16.0%
396
 
9.3%
476
 
7.4%
520
 
1.9%
ValueCountFrequency (%)
520
 
1.9%
476
 
7.4%
396
 
9.3%
2165
 
16.0%
1253
24.5%
0423
40.9%

Late Submission
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
1
424 
0
409 
2
175 
3
 
25

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1033
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Length

2022-08-16T16:43:18.636642image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-16T16:43:18.776268image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Most occurring characters

ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1033
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Most occurring scripts

ValueCountFrequency (%)
Common1033
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1033
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1424
41.0%
0409
39.6%
2175
16.9%
325
 
2.4%

Pass
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.62891255
Minimum16.66666667
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2022-08-16T16:43:18.892957image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum16.66666667
5-th percentile33.33333333
Q1100
median100
Q3100
95-th percentile100
Maximum100
Range83.33333333
Interquartile range (IQR)0

Descriptive statistics

Standard deviation19.82447674
Coefficient of variation (CV)0.2163561281
Kurtosis3.843668329
Mean91.62891255
Median Absolute Deviation (MAD)0
Skewness-2.273931639
Sum94652.66667
Variance393.0098782
MonotonicityNot monotonic
2022-08-16T16:43:19.011055image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
100849
82.2%
33.3333333352
 
5.0%
5050
 
4.8%
66.6666666739
 
3.8%
83.3333333332
 
3.1%
16.6666666710
 
1.0%
861
 
0.1%
ValueCountFrequency (%)
16.6666666710
 
1.0%
33.3333333352
 
5.0%
5050
 
4.8%
66.6666666739
 
3.8%
83.3333333332
 
3.1%
861
 
0.1%
100849
82.2%
ValueCountFrequency (%)
100849
82.2%
861
 
0.1%
83.3333333332
 
3.1%
66.6666666739
 
3.8%
5050
 
4.8%
33.3333333352
 
5.0%
16.6666666710
 
1.0%

Re Takes
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
False
878 
True
155 
ValueCountFrequency (%)
False878
85.0%
True155
 
15.0%
2022-08-16T16:43:19.147201image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

desertion
Boolean

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
False
874 
True
159 
ValueCountFrequency (%)
False874
84.6%
True159
 
15.4%
2022-08-16T16:43:19.264396image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Interactions

2022-08-16T16:43:06.545812image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.159601image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.367096image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.397047image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.448472image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.475082image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.396128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:54.631978image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.507030image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.452941image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.332420image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.296761image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.346425image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:06.694416image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.327712image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.513211image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.548642image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.606050image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.627675image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.537301image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:54.767587image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.663612image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.603929image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.486011image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.450351image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.492037image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:06.848004image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.480589image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.673782image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.712205image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.778589image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.782261image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.688920image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:54.911710image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.831673image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.758025image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.664532image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.617903image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.646664image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.005583image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.630190image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.856295image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.880755image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.946141image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.939842image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.848493image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.056829image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.995237image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.911124image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.839067image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.779489image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.801271image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.218141image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.794751image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.017891image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.060054image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.114690image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.092434image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.997741image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.212967image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.151818image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.064742image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.995648image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.939071image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.282982image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.356639image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:42.945370image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.171454image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.227051image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.268283image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.238331image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.142883image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.351106image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.298425image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.206359image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.141292image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.080692image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.418621image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.504247image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:43.334816image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.317085image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.375249image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.418884image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.375833image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.283023image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.488247image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.441045image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.346988image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.289955image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.222808image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.555256image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.648864image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:43.469025image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.457254image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.512887image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.561504image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.512974image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.419191image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.624881image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.578678image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.483593image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.431548image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.370413image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.689406image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.794747image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:43.625582image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.612900image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.676030image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.718594image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.667101image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.564287image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.771490image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.729275image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.631228image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.576667image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.516024image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.841996image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:07.983749image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:43.781166image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.766460image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.834166image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:49.874709image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.815675image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.709898image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:55.918098image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:57.877387image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.771824image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.722297image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.711502image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:05.984620image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:08.121381image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:43.927284image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:45.914829image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:47.990747image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.026281image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:51.959291image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:53.855531image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.062711image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.019041image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:59.911472image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:01.863917image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:03.942885image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:06.122942image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:08.258016image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.075698image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.070898image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.137791image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.176879image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.111882image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:54.333231image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.209320image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.162657image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.057061image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.004543image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.078522image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:06.262569image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:08.398641image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:44.223453image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:46.232486image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:48.299870image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:50.330491image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:52.257494image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:54.479354image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:56.355436image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:42:58.309837image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:00.196784image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:02.154143image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:04.213162image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-08-16T16:43:06.406185image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Correlations

2022-08-16T16:43:19.387069image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-08-16T16:43:19.663330image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-08-16T16:43:19.931612image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-08-16T16:43:20.206449image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.
2022-08-16T16:43:20.557481image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-08-16T16:43:08.751215image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
A simple visualization of nullity by column.
2022-08-16T16:43:10.082247image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2022-08-16T16:43:10.636770image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2022-08-16T16:43:11.110507image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

CourseUCAS25 AboveDisabilityEthnicityGenderBritishEnglish native LanguageParent He attendancePolar 4 ScoreSLCCare LeaverStudent VisaRefugeeLondon Permanent ResidenceUCAS PointsEnglishMathsA LevelsBtecPrevious workBursaryAttendanceAWM year 1AWM year 2AWM year 3Overall AWMProgressFirst SitSecond SitFailsNo SubmissionsLate SubmissionPassRe Takesdesertion
0BA Business Manangement Enterpreneurship and InnovationnononoAsianMalenonoyes4.0nonoyesnoyes98.05.04.0yesnoyesno868558.043.062.000000yes33.0022100.000000yesno
1BA Business ManagementnononoWhiteMalenonoyes2.0yesnononono101.05.05.0yesnoyesyes554032.0NaN36.000000no12.053083.333333noyes
2BA Business Management Enterpreneurship and InnovationnononoAsianMaleyesyesyes4.0yesnononoyes129.04.04.0yesnoyesno5741NaNNaN41.000000yes60.0000100.000000noyes
3BA Business ManagementnoyesnoWhiteFemalenonono3.0yesnononoyes110.09.08.0yesnoyesno484143.0NaN42.000000yes60.0000100.000000noyes
4BA Business Management Enterpreneurship and InnovationnononoAsianMaleyesyesyes4.0yesnononoyes130.06.05.0yesnoyesno835549.059.054.333333yes42.0020100.000000nono
5BA Business Management Enterpreneurship and InnovationyesnonoAsianMaleyesyesyes3.0yesnononoyes112.06.04.0noyesnono714646.043.045.000000yes33.0001100.000000nono
6BA Business Management MarketingyesnonoWhiteMalenoyesno5.0nonoyesnono89.06.05.0yesnonono967870.079.075.666667yes42.0002100.000000nono
7BA Business Management Enterpreneurship and InnovationyesnonoWhiteMaleyesyesno4.0yesnononoyes103.04.05.0yesnonono674385.061.063.000000yes33.0030100.000000nono
8BA Business Management Enterpreneurship and InnovationyesnonoWhiteMaleyesyesno2.0nonononoyes128.04.04.0noyesnoyes897658.044.059.333333yes60.0000100.000000nono
9BA Business ManagementyesnonoWhiteFemaleyesyesnoNaNnonononono91.04.04.0nononono924983.067.066.333333yes60.0011100.000000nono

Last rows

CourseUCAS25 AboveDisabilityEthnicityGenderBritishEnglish native LanguageParent He attendancePolar 4 ScoreSLCCare LeaverStudent VisaRefugeeLondon Permanent ResidenceUCAS PointsEnglishMathsA LevelsBtecPrevious workBursaryAttendanceAWM year 1AWM year 2AWM year 3Overall AWMProgressFirst SitSecond SitFailsNo SubmissionsLate SubmissionPassRe Takesdesertion
1023BAyesnonoAsianMalenonoyes5.0yesnononono107.06.07.0noyesnono9680NaNNaN80.0yes60.0001100.000000nono
1024BAyesyesnoWhiteMaleyesNaNno3.0yesnononono103.05.06.0noyesyesno6740NaNNaN40.0yes15.0030100.000000nono
1025BAyesnonoNaNMaleyesNaNyes4.0yesNaNnonono100.05.04.0yesnoyesno7053NaNNaN53.0yes60.0001100.000000nono
1026BAyesyesnoNaNFemalenonoyes3.0yesNaNnonono113.03.06.0noyesnono6473NaNNaN73.0yes33.0021100.000000nono
1027BAyesyesnoNaNMaleyesnoyes2.0yesNaNnonoyes118.05.05.0yesnoyesyes9668NaNNaN68.0yes33.0010100.000000nono
1028BAyesnonoOther ethnic backgroundFemalenoyesno5.0yesNaNnonono102.04.04.0yesnoyesno5545NaNNaN45.0yes60.0001100.000000noyes
1029BAyesnonoNaNMaleyesyesyes4.0yesnononoyes109.04.04.0yesnoyesno6677NaNNaN77.0yes60.0000100.000000nono
1030BAnoyesnoAsianFemalenonono3.0nonononono104.06.05.0yesnoyesno4233NaNNaN33.0no11.024133.333333yesyes
1031BAnoyesnoOther ethnic backgroundMalenonoyes4.0yesnononono101.06.06.0noyesnono6076NaNNaN76.0yes60.0000100.000000nono
1032BAnoyesnoOther ethnic backgroundFemalenononoNaNnonoyesnono104.08.04.0nononono7180NaNNaN80.0yes60.0000100.000000nono